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To  investigate  the  effects  of  implicit  solvents  on  peptide  structure  and  dynamics,  we  performed  extensive 
molecular  dynamics  simulations  on  the  penta-peptide  Cys-Ala-Gly-Gln-Trp.  Two  different  implicit  solvent 
models  based  on  the  CHARMM22  all-atom  force  field  were  used.  Structural  properties  of  the  peptide  such 
as  distributions  of  end-to-end  distances  and  dihedral  angles  obtained  from  molecular  dynamics  simulations 
with  implicit  solvent  models  were  in  a  good  agreement  with  those  obtained  from  a  previous  explicit  solvent 
simulation  using  the  same  force  field.  Representative  structures  observed  in  explicit  solvent  were  sampled  by 
implicit  solvent  models  but  with  different  relative  probabilities.  However,  we  observed  significant  differences 
in  dynamical  properties  in  explicit  and  implicit  solvent  models  when  we  used  traditional  methods  for  the 
temperature  control,  such  as  Nose— Hoover  or  Berendsen  thermostats.  The  explicitly  solvated  peptide  displayed 
the  slowest  dynamics  in  both  end-to-end  contact  formation  and  intrinsic  diffusive  motion  of  end-to-end 
distances.  A  closer  agreement  between  implicit  and  explicit  solvated  peptide  dynamics  was  observed  when 
Langevin  dynamics  with  a  friction  coefficient  of  10  ps-1  was  used  to  maintain  the  temperature  of  the  systems. 


Introduction 

Molecular  dynamics  (MD)  simulation  techniques  are  widely 
used  to  study  structure  and  dynamics  of  biomolecular  systems 
in  atomic  detail.  An  accurate  and  efficient  treatment  of  solvation 
continues  to  be  a  challenge  in  simulations  of  biomolecular 
systems.  The  development  of  computationally  efficient  methods 
such  as  the  particle  mesh  Ewald  method1'2  for  calculating  long- 
range  electrostatic  forces  contributed  to  a  wider  adoption  of 
explicit  representation  of  water  molecules  in  simulations  of 
biological  macromolecules.  However,  the  computational  cost 
of  employing  explicit  solvent  simulations  is  still  significant.  In 
addition,  the  large  number  of  degrees  of  freedom  due  to  explicit 
solvent  water  molecules  poses  significant  challenges  in  applying 
advanced  sampling  techniques  such  as  replica-exchange  MD.3-6 
Various  implicit  solvent  models7-13  have  been  developed  as  an 
alternative  to  expensive  explicit-solvent  simulations.  These 
implicit  solvent  models  have  been  mainly  used  to  explore 
structural  properties  of  proteins.  Zhou  and  Berne14  compared 
the  free  energy  landscapes  of  the  C-terminal  /3-hairpin  of  protein 
G  obtained  by  explicit  and  implicit  solvent  simulations  based 
on  the  OPLS-AA15  force  field  with  an  efficient  conformational 
sampling  by  a  highly  parallel  replica-exchange  method.  They 
found  that  the  free  energy  landscape  from  the  surface-general¬ 
ized  Bom  (SGB)12  implicit  solvent  model  is  quite  different  from 
that  of  the  explicit  solvent  model.  In  the  free  energy  landscape 
calculated  by  the  SGB  implicit  solvent  model,  the  lowest  free 
energy  state  was  not  the  native  /3-strand  structure  observed  in 
the  explicit  solvent  simulation  and  the  experiments.  They 
attributed  this  behavior  to  an  overly  strong  salt  bridge  effect16'17 
between  charged  residues  in  the  SGB  implicit  solvent  model. 
Felts  et  al.18  obtained  a  better  agreement  with  the  explicit  solvent 
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results  by  using  the  effective  potential  consisting  of  the  OPLS- 
AA  force  field  and  the  analytical  generalized  Bom  plus  nonpolar 
(AGBNP)13  implicit  solvent  model.  Increased  dielectric  screen¬ 
ing  incorporated  in  the  AGBNP  model  reduced  the  stability  of 
incorrectly  formed  salt  bridges,  which  tend  to  disrupt  the 
formation  of  the  /3-hairpin.  This  underscores  that  a  proper 
balance  between  the  nonpolar  effective  potential  terms,  which 
drive  compaction,  and  the  electrostatic  interactions,  which 
promote  secondary  structure  formation  in  the  implicit  solvent 
model,  has  to  be  made  to  reproduce  structural  properties 
observed  with  the  explicit  solvent  simulations  or  experiments.18 
Fan  et  al.19  compared  performances  of  three  different  generalized 
Born  (GB)  models  in  reproducing  the  backbone  root-mean- 
square  deviation  from  the  native  structure,  the  number  of 
hydrogen  bonds  retained  in  the  simulation,  and  the  experimental 
and  calculated  radius  of  gyration  for  10  different  proteins. 
Recently,  Hamelberg  et  al.20  probed  the  role  of  hydration  on 
protein  dynamics  by  MD  simulations  using  a  generalized  Born 
(GB)  implicit  solvation  model  based  on  the  AMBER  force 
field.11  Similarly,  Feig21  examined  kinetic  properties  of  the 
alanine  dipeptide  and  selected  proteins  in  an  implicit  solvent 
modeled  by  the  generalized  Born  molecular  volume  (GBMV) 
method.910  Here  we  present  theoretical  studies  of  end-to-end 
loop  formation  dynamics  of  a  small  peptide  employing  different 
implicit  solvent  models  in  order  to  investigate  dynamical 
artifacts  introduced  by  implicit  solvents. 

The  end-to-end  contact  formation  of  amino  acid  residues  is 
one  of  the  elementary  processes  in  protein  folding  and  has  been 
extensively  studied  both  experimentally22-26  and  theoreti¬ 
cally.25'27-36  The  end-to-end  contact  formation  dynamics  of  the 
penta-peptide  Cys-Ala-Gly-Gln-Trp  (CAGQW)  has  been  ex¬ 
perimentally  studied  with  triplet  quenching.22  The  time  scale 
of  the  end-to-end  contact  formation  of  this  peptide  was 
determined  to  be  on  the  order  of  100  ns,  a  time-scale  which  is 
accessible  in  atomically  detailed  MD  simulations.  Extensive  MD 
simulations  of  the  peptide  in  explicit  solvent  with  an  aggregate 
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simulation  time  on  the  order  of  microsecond  have  been 
performed  with  both  AMBER9437  and  CHARMM2238  force 
fields.30  The  time  scale  of  the  end-to-end  contact  formation 
estimated  by  the  simulation  is  on  the  order  of  10  ns.  The 
discrepancy  of  time  scale  between  the  experiment  and  the 
simulation  was  attributed  to  the  finite  quenching  rate  of  the  Tip 
by  the  Cys  residue.  Indeed,  a  recent  fluorescence-based  mea¬ 
surement25  obtained  a  much  faster  time  scale  of  14  ns  for  the 
end-to-end  contact  formation  of  a  similar  penta-peptide  where 
the  Cys  of  the  CAGQW  peptide  was  replaced  by  an  efficient 
fluorescent  probe,  2,3-diazabicyclo[2.2.2]oct-2-ene-labeled  as¬ 
paragine  (Dbo).  Explicit  solvent  simulation  studies  by  Roccatano 
et  al.25,31  on  the  CAGQW  and  Dbo-AGQW  penta-peptides  with 
GROMOS9639  and  OPLS-AA/L40  force  fields  also  found  that 
the  simulated  dynamics  is  faster  than  the  experimentally 
measured  dynamics.  They  also  noted  that  the  force  field  had  a 
large  influence  on  the  dynamical  properties  on  the  peptide  based 
on  the  effective  size  of  the  bander  to  formation  of  the  compact 
structures.  Faster  end-to-end  formation  kinetics  was  observed 
for  GROMOS96  and  OPLS-AA/L  force  fields  as  compared  with 
AMBER94  or  CHARMM22  force  fields. 

In  this  study,  we  investigated  the  effect  of  treating  the  solvent 
implicitly  on  structural  and  dynamical  properties  of  the  CAGQW 
peptide  by  performing  extensive  MD  simulations  with  general¬ 
ized  Born  implicit  solvent  models  based  on  the  CHARMM22 
force  field  and  comparing  the  result  to  the  explicit-solvent 
simulation  using  the  same  force  field.30  We  observed  that 
structural  properties  of  the  peptide  sampled  by  explicit  solvent 
simulations  can  be  reasonably  represented  by  simulations  with 
generalized  Born  implicit  solvent  models.  However,  we  ob¬ 
served  a  significant  difference  in  dynamical  properties  of  the 
peptide  obtained  by  simulations  with  explicit  and  implicit  solvent 
models  when  we  employed  conventional  methods  for  the 
temperature  control.  This  difference  can  partly  be  removed  by 
performing  simulations  coupled  with  Langevin  dynamics  at  a 
friction  coefficient  of  10  ps-1,  as  has  been  noted  previously.20,21 

Methods 

We  performed  MD  simulations  of  the  CACQW  penta-peptide 
with  the  CHARMM  MD  simulation  program,41  version  c31b2. 
The  MMTSB  Tool  Set42  was  used  to  facilitate  the  simulations. 
As  in  the  experiments,22  the  C-terminus  of  the  peptide  was 
amidated  while  the  N-terminus  remained  unblocked.  We  used 
two  generalized  Born  implicit  solvent  models  available  in 
CHARMM,  GBSW  (generalized  Born  with  a  smooth  switch¬ 
ing)8  and  GBMV2  (generalized  Born  molecular  volume  version 
2). 10  In  the  GBMV2  model,  the  /3s-parameter  determines  the 
smoothness  of  the  molecular  volume  at  the  protein— solvent 
dielectric  boundary,  while  the  So-variable  is  adjusted  to  match 
Poisson  solvation  energies.  Here  we  used  the  original  set  of 
parameters  with  values  of  /3S  of  —20  and  So  of  0.7. 10,43  In  the 
GBSW  model,  three  parameters  are  specified,  a  smoothing 
length  w,  a  parameter  that  modulates  the  Coulomb  field  term 
a0,  and  an  empirical  correction  term  a\.  We  employed  the  set 
of  parameter  values  denoted  as  GBSW2, 43,44  which  were 
optimized  to  fit  the  Lee— Richards  molecular  surface45  Poisson 
results.  Thus,  we  set  the  smoothing  length  w  to  0.2  A  and  a0 
and  a\  to  1.2045  and  0.1866,  respectively.  We  also  performed 
simulations  with  the  parameter  set  in  the  original  GBSW  model 
denoted  as  GBSW1  with  w,  a0>  and  a\  of  0.3  A,  —0.1801,  and 
1.8174,  respectively.  The  atomic  input  radii  optimized  by  Nina 
et  al.46  and  Chen  et  al  47  have  been  used  for  simulations  with 
the  GBSW1  model.  The  input  radii  for  simulations  with 
GBMV2  and  GBSW2  were  taken  to  be  the  standard  CHARMM 


atomic  radii  as  these  models  were  optimized  with  these 
radii.10,43,44  These  implicit  solvent  models  have  been  successfully 
applied  in  recent  simulation  studies  to  calculate  thermodynamic 
properties  of  the  SH3  domain  of  a-spectrin48  and  the  absolute 
ligand  binding  free  energy  to  a  ribosome-targeting  protein.49 
The  nonpolar  solvation  energy  was  accounted  for  by  the  default 
energy  terms  expressed  as  the  linear  product  of  the  solvent- 
exposed  surface  area  of  the  solute  and  a  phenomenological 
surface  tension  coefficient  (y)  set  to  0.030  and  0.015  kcal/ 
(mol -A2)  for  the  GBSW2  and  GBMV2  models,  respectively. 
The  same  y  value  of  0.015  kcal/(mol  •  A2)  was  also  used  in  a 
previous  simulation  with  GBMV2  model  43  The  differences  in 
y  used  in  GBSW2  and  GBMV2  can  partly  be  attributed  to  the 
different  surface  representations  used  to  calculate  the  solvent 
exposed  surface  area.49  Two  different  y  values  of  0.030  and 
0.005  kcal/(mol"  A2)  used  in  previous  simulations8,47  with  the 
GBSW1  model  were  employed  to  gauge  the  effect  of  the  choice 
of  y  on  the  structural  properties  of  the  peptide.  We  also 
performed  MD  simulations  of  the  CACQW  penta-peptide  in 
dielectric  continuum  with  varying  dielectric  constants  e  of  1, 
4,  and  80.  We  used  data  from  a  previous  simulation  of  the 
CACQW  peptide  in  explicit  solvent  for  comparison  and  further 
analysis.30 

We  used  several  different  methods  for  temperature  control, 
a  Nose— Hoover  thermostat50,51  with  a  thermal  inertia  parameter 
of  50  kcal/(mol  ps2),  a  Berendsen  thermostat52  with  a  coupling 
constant  of  5  ps,  and  Langevin  dynamics53  with  friction 
coefficients  of  2,  10,  and  60  ps~'  applied  to  non-hydrogen  atoms. 
Although  the  true  friction  is  nonisotropic  and  atom  dependent, 
a  single  isotropic  friction  coefficient  was  applied  to  all  heavy 
atoms  for  simplicity.  A  MD  integration  time  step  of  2  fs  was 
used  except  for  Langevin  dynamics  simulations  employing  a 
friction  coefficient  of  60  ps-1,  where  a  shorter  time  step  of  1  fs 
was  required.  Distances  used  for  the  onset  of  damping  non- 
bonded  interaction,  the  cutoff  for  nonbonded  interactions,  and 
the  cutoff  for  nonbonded  list  generation  were  20,  22,  and  25 
A,  respectively.  For  a  small  5-residue  peptide  used  in  our  study, 
these  cutoff  distances  amount  to  calculations  without  any  cutoff 
distances.  Covalent  bonds  between  the  heavy  atoms  and 
hydrogens  were  constrained  by  the  SHAKE  algorithm.54 

The  fully  extended  conformation  of  the  CACQW  peptide  was 
used  as  a  starting  configuration  of  a  1  ns  MD  simulation  at  a 
temperature  of  700  K  controlled  by  a  Nose— Hoover  thermostat. 
Fifty  configurations  sampled  from  this  simulation  trajectory  from 
each  implicit  solvent  model  were  then  used  as  starting  configu¬ 
rations  for  50  independent  production  runs  at  300  K  for  the 
corresponding  implicit  solvent  model  studied  in  this  work.  Each 
of  the  50  simulations  lasted  21  ns  and  trajectories  were  sampled 
after  an  initial  100  ps  equilibration  unless  specified  otherwise. 
A  simple  equal  time-block  analysis  showed  no  significant 
differences  in  thermodynamic  and  structural  properties  calcu¬ 
lated  with  the  data  from  different  time  blocks,  indicating  that 
the  proper  equilibration  was  achieved.  Each  21  ns  simulation 
required  about  53.7  and  21.5  h  to  finish  on  a  single  core  of  3.0 
GHz  Intel  Woodcrest  processor  with  GBMV2  and  GBSW2 
models,  respectively.  Coordinates  were  saved  at  every  1  ps  for 
further  analysis. 

The  end-to-end  distance  (re)  of  the  peptide  was  defined  as 
the  distance  between  the  sulfur  atom  of  the  initial  cysteine  side 
chain  and  the  closest  non-hydrogen  atom  of  the  tryptophan 
indole  ring.  We  defined  end-to-end  contact  formation  as  taking 
place  when  the  end-to-end  distance  is  less  than  4  A.  The  same 
value  of  the  contact  distance  ( dc )  was  used  in  previous 
experimental22  and  computational30  studies.  We  also  used  contact 
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distances  estimated  from  simulations  for  the  analysis  of  end- 
to-end  contact  formation  dynamics.  Then  5opcn(f),  the  survival 
probability  of  the  open  noncontact  state  of  the  peptide  between 
time  0  and  t,  is  1  for  t  <  tc,  where  fc  is  the  first-contact  time, 
and  0  for  t  >  tc.  We  assumed  that  every  saved  noncontact 
configuration  (re  >  dc)  for  each  MD  simulation  run  is  a  valid 
starting  point.  The  reported  survival  probability  5open(f)  was 
estimated  by  the  averaged  survival  probabilities  over  all  starting 
points  of  the  respective  simulations.  The  average  contact 
formation  time  ropen  or  the  lifetime  of  the  peptide  in  the 
noncontact  state  was  estimated  by  calculating  the  mean  first 
passage  time  from  the  following  expression27'30 

ropen  =  /cTXpenW^  O 

Conversely,  we  can  define  the  survival  probability  of  the 
contact  state  5ciosed(f)  between  time  0  and  t,  as  1  for  t  <  ta,  where 
ta  is  the  first  contact-opening  time,  and  0  for  t  >  ta.  Integration 
of  this  survival  probability  yields  the  corresponding  rcioselj,  the 
lifetime  of  the  peptide  in  the  closed  contact  state.  The  contact 
formation  rate  kc  was  defined  as  ropen,  and  the  corresponding 
contact  opening  rate  k0  as  rddsed-  The  rate  constants  estimated 
from  this  definition  are  exact  only  when  5(f)  is  exponential  and 
approximate  otherwise. 

Clustering  analysis  of  structures  of  the  CAGQW  peptide  that 
form  end-to-end  contacts  was  performed  with  the  average- 
linkage  clustering  algorithm’’5  implemented  in  the  ptraj  module 
of  Amber,  version  10.56 

Error  bars  reported  in  this  work  were  based  on  the  95% 
confidence  interval  assuming  random  sampling  (1.96  times 
standard  error)  and  obtained  by  analysis  of  50  independent 
simulations  runs  for  each  model. 

Results  and  Discussions 

Structural  Properties.  End-to-End  Distance  Distribution. 

In  general,  solvation  in  aqueous  media  provides  the  solute  with 
the  capability  to  form  a  multitude  of  hydrogen-bonded  interac¬ 
tions  with  individual  water  molecules.  For  a  small  peptide,  this 
effect  is  typically  manifested  in  the  attenuation  of  intramolecular 
hydrogen  bonding  interactions.  Figure  1  shows  distributions  of 
end-to-end  distances  of  the  CAGQW  peptide  in  dielectric- 
continuum  simulations  with  dielectric  constants  £  of  1,  4,  and 
80  contrasted  with  the  results  obtained  from  using  explicit 
solvent  water  molecules.  In  the  vacuum  simulation  (£  =  1) 
where  the  Coulomb  interactions  are  not  shielded,  conformations 
characterized  by  strong  internal  hydrogen  bonding  between 
carbonyl  oxygens  and  N-terminal  hydrogens  dominated.  This 
is  shown  by  the  strong  preference  of  end-to-end  distances  located 
around  8  A  in  the  vacuum  simulation.  As  the  dielectric  screening 
is  increased  from  vacuum  to  a  simulated  general  hydrophobic 
environment  of  (e  =  4)  to  an  aqueous  environment  (e  =  80), 
the  strength  of  the  internal  hydrogen  bonding  formed  by  the 
peptide-groups  diminishes.  However,  the  simple  dielectric 
screening  model  is  not  capable  of  capturing  the  full  effect  of 
the  explicit  solvation.  In  particular,  the  high  dielectric  simulation, 
corresponding  to  an  aqueous  environment  (£  =  80),  overem¬ 
phasizes  the  contact  conformation  (re  <  4  A)  and  under¬ 
represents  the  extended  conformations  (re  ^  10  A).  The 
extended  conformations  in  the  high  dielectric  approximation 
essentially  lack  the  stabilizing  peptide/water  hydrogen  bonding 
that  occurs  in  the  explicit  solvent  model. 

Influence  of  Thermostats  on  Implicit  Models.  From  a 
statistical  mechanical  point  of  view,  MD  trajectories  are  used 


End-to-end  contact 


End-to-end  distance  re  (A) 

Figure  1.  P(re),  probability  distributions  of  the  end-to-end  distance 
(re)  of  the  CAGQW  peptide  obtained  by  simulations  with  explicit 
solvent  model  and  with  different  dielectric  continuum  models.  The 
distribution  obtained  with  the  explicit  solvent  model  is  shown  as  a  solid 
line.  The  results  for  the  explicit  solvent  model  shown  in  this  and  the 
following  figures  were  obtained  from  a  previous  simulation  of  the 
CACQW  peptide  in  explicit  solvent.30  Distributions  with  continuum 
models  with  dielectric  constant  £  of  1  (vacuum),  4,  and  80  are 
represented  by  a  dotted  line  and  lines  with  circles  and  squares, 
respectively.  A  dashed  vertical  line  ( re  =  4  A)  indicates  a  boundary  of 
end-to-end  contact  fomiation.  Error  bars  in  this  and  the  following  figures 
show  the  95%  confidence  interval  estimated  by  multiplying  1.96  to 
standard  errors  obtained  by  analysis  of  50  independent  simulations  runs 
for  each  model. 

to  generate  conformations  from  which  one  can  form  thermo¬ 
dynamic  averages.  Provided  that  the  sampling  is  ergodic  and 
that  the  time-step  used  in  the  simulations  is  small  enough,  the 
method  used  to  generate  the  trajectories  should  not  influence 
the  results.  In  practice,  for  smaller  systems  where  coupling  to 
solvent  degrees  of  freedom  can  dominate  the  mechanisms  of 
temperature  equilibration,  the  choice  of  thermostat  to  control 
the  temperature  can  and  will  influence  calculated  proper¬ 
ties.20'21’57,58  Figure  2a  illustrates  this  effect  for  distributions  of 
end-to-end  distances  calculated  from  MD  simulations  using  the 
GBSW2  implicit  solvent  model  with  five  different  thermostats. 

Overall,  the  distributions  of  end-to-end  distances  derived  from 
simulations  with  the  GBSW2  model  capture  the  major  features 
of  the  explicit  solvent  simulation  in  sharp  contrast  to  the 
dielectric  continuum  model  results  shown  in  Figure  1.  In 
particular,  the  implicit  model  qualitatively  captures  the  stabiliza¬ 
tion  of  the  extended  conformations  (re  S  10  A).  However,  the 
details  of  the  distributions  depended  on  the  choice  of  thermo¬ 
stats,  both  at  distances  near  the  contact  distance  and  at  extended 
distances.  In  simulations  with  simplified  models  without  explicit 
solvent.  Nose— Hoover  or  Berendsen  temperature  control  meth¬ 
ods  with  a  single  thermostat  were  shown  to  introduce  erroneous 
descriptions  of  thermodynamic  and  kinetic  properties  of 
systems.21’58'59  Figure  2a  shows  that  simulations  with  Nose- 
Hoover  or  Berendsen  temperature  control  methods  result  in 
slightly  increased  populations  of  more  compact  states  with 
smaller  end-to-end  distances  compared  to  simulations  with  the 
explicit  solvent  model  and  Langevin  dynamics.  It  was  recently 
shown  that  simulations  with  GBMV2  implicit  solvent  model 
combined  with  Langevin  dynamics  with  a  friction  coefficient 
of  10  ps_1  reproduced  kinetic  properties  of  peptides  and  proteins 
obtained  from  explicit  solvent  simulations  and  experiments.21 
From  our  simulations,  we  can  see  that  Langevin  dynamics 
thermostats  with  different  friction  coefficients  resulted  in  a 
generally  similar  distribution  of  end-to-end  distances  and  a  closer 
agreement  with  the  explicit  solvent  model.  However,  a  smaller 
time  step  of  1  fs  and  a  longer  equilibration  time  of  20  ns  were 
necessary  for  the  simulation  employing  a  Langevin  friction 
coefficient  of  60  ps  !  in  order  to  obtain  a  distribution  of  end- 
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(a)  End-to-end  contact  —  Explicit  solvent 


(b^  End-to-end  contact  —  Explicit  solvent 


Figure  2.  (a)  Probability  distributions  of  end-to-end  distance  of  the 
CAGQW  peptide  obtained  by  simulations  with  the  GBSW2  implicit 
solvent  model  using  different  temperature-control  methods.  The 
distribution  obtained  with  the  explicit  solvent  model  is  also  shown  as 
a  solid  line.  DisU'ibutions  obtained  with  Nose— Hoover  and  Berendsen 
thermostats  are  shown  with  lines  with  circles  and  squares,  respectively. 
Distributions  obtained  with  simulations  coupled  with  Langevin  dynam¬ 
ics  with  friction  coefficients  with  2,  10,  and  60  ps_1  are  represented  by 
lines  with  diamonds  and  triangles  pointing  up  and  left,  respectively. 
For  the  distribution  generated  with  a  friction  coefficient  of  60  ps^1,  we 
only  used  data  from  the  last  1  ns  of  each  21  ns  trajectory  due  to  the 
long  equilibration  required  for  this  simulation,  (b)  Comparison  of 
probability  distributions  of  end-to-end  distance  of  the  CAGQW  peptide 
obtained  by  simulations  with  explicit  solvent  (solid  line),  GBMV2  (a 
line  with  circles),  GBSW2  (a  line  with  triangles),  GBSW1  with  y  of 
0.030  kcal/(mol' A2)  (a  line  with  diamonds),  and  GBSW1  with  y  of 
0.005  kcal/(mol  •  A2)  (a  line  with  triangles).  The  results  from  the 
GBMV2,  GBSW2,  and  GBSW1  models  were  obtained  with  the 
Langevin  friction  coefficient  of  10  ps_1. 

to-end  distances  similar  to  those  obtained  with  smaller  friction 
coefficients.  Consistent  with  earlier  work,21  choices  of  the 
friction  coefficient  10  ps_1  or  below  tend  to  have  minimal  impact 
on  overall  structural  and  dynamical  properties.  Thus,  we  adopted 
Langevin  dynamics  with  a  friction  coefficient  of  10  ps_1  as  the 
method  of  choice  to  generate  ensemble  trajectories  for  general¬ 
ized  Born  implicit  solvent  models. 

GBSW2  versus  GBMV2  and  GBSW1  Models.  Figure  2b 
compares  distributions  of  end-to-end  distances  obtained  from 
GBMV2,  GBSW1,  and  GBSW2  implicit  solvent  models  to  the 
explicit  solvent  model.  Overall,  both  GBSW2  and  GBMV2 
implicit  solvent  models  showed  reasonable  agreements  of  end- 
to-end  distance  distributions  compared  with  explicit  solvent 
simulations.  Both  implicit  models  have  significant  populations 
at  extended  conformations  with  re  >  10  A.  However,  both 
implicit  models  consistently  generated  smaller  populations  of 
peptides  whose  end-to-end  distances  are  within  the  contact 
distance  of  4  A  as  compared  to  the  explicit  solvent  simulation. 
Further,  neither  model  reproduced  the  small  but  pronounced 
peak  near  the  end-to-end  distance  of  8.3  A.  This  peak  is 
characterized  by  peptide  conformations  where  the  end  residues 
Cys  and  Trp  make  simultaneous  contacts  with  the  Gin  residue 
in  the  middle.  These  interactions  are  diminished  in  the  implicit 
models;  in  particular  the  Cys/Gln  interaction  in  the  GBSW2 
model  is  not  properly  accounted  for.  In  contrast,  distributions 


of  end-to-end  distances  calculated  from  simulations  with  the 
GBSW1  model  overestimated  populations  within  the  contact 
distance  and  near  the  8.3  A  peak  and  underestimated  the 
population  of  extended  conformations.  Even  though  a  smaller 
phenomenological  surface  tension  coefficient  y  with  the  GBSW1 
model  increased  the  relative  population  of  extended  conforma¬ 
tions,  a  significant  difference  remained  between  distributions 
of  end-to-end  distances  with  GBSW1  and  explicit  solvent 
models. 

Dihedral  Angle  Distributions.  To  further  characterize  struc¬ 
tural  properties  of  the  peptide,  we  calculated  the  potential  of 
mean  force  (PMF)  associated  with  the  dihedral  angles  0  and  0 
of  the  three  middle  amino  acid  residues,  Ala,  Gly,  and  Gin. 
Figure  3  shows  PMF  surfaces  obtained  from  explicit  solvent 
simulations  together  with  those  obtained  from  GBMV2  and 
GBSW2  models.  For  Ala  and  Gin,  both  implicit  models  showed 
dihedral  angle  PMFs  similar  to  what  was  obtained  from  the 
explicit  solvent  simulation.  The  PMF  surface  of  the  Gly  residue 
located  in  the  middle  of  the  CAGQW  peptide  is  associated  with 
significant  populations  for  dihedral  angles  near  0  =  ±180 
degrees  and  ip  =  ±180  degrees,  consistent  with  the  extended 
conformations  shown  in  Figure  2b.  These  are  qualitatively 
similar  for  both  implicit  models  and  the  explicit  solvent  results. 
However,  there  are  small  differences  in  the  PMF  surfaces  that 
can  primarily  be  related  to  the  shape  of  the  extended  conforma¬ 
tions.  Overall,  we  obtained  a  good  agreement  between  PMF 
surfaces  of  dihedral  angles  calculated  with  implicit  and  explicit 
solvent  models.  The  CAGQW  peptide  studied  here  does  not 
have  any  charged  residues,  which  could  present  problematic 
salt  bridge  effects  as  observed  in  free  energy  landscapes  of  the 
C-terminal  /5-hairpin  of  protein  G  calculated  with  the  SGB 
implicit  solvent  model.14  In  addition,  both  GBMV2  and  GBSW2 
models  used  in  this  study  were  developed  to  reproduce  a 
dielectric  boundary  defined  by  Lee— Richards  molecular  sur¬ 
face,45  which  minimizes  the  overestimation  of  the  solvation 
energy  due  to  the  interstitial  high  dielectric  regions  that  occurs 
in  the  simplistic  treatment  of  the  dielectric  boundary. 10,43,60 
Indeed,  a  good  agreement  with  the  PMF  distribution  of  dihedral 
angles  calculated  with  the  explicit  solvent  model  was  also 
observed  in  other  studies  with  generalized  Born  models  where 
the  interstitial  high  dielectric  regions  are  properly  corrected.20,21,60 

End-to-End  Contact  Ensemble.  While  the  above  dihedral 
angle  analysis  showed  the  average  distribution  derived  from  all 
conformational  samples,  we  would  like  to  focus  on  the  differ¬ 
ences  exhibited  in  the  set  of  peptide  conformations  that  form 
the  compact  contact  state  re  <  4  A.  We  selected  1000  structures 
whose  end-to-end  distances  are  within  the  contact  distance  of 
4  A  from  the  peptide  conformations  generated  by  each  solvent 
model.  We  performed  clustering  analysis  on  these  structures  as 
described  in  Methods  and  selected  the  top  three  out  of  five 
representative  clusters.  The  top  three  clusters  typically  contained 
more  than  95%  of  tested  contact-conformations.  Figure  4  shows 
representative  end-to-end  contact  structures  from  these  clusters 
obtained  by  the  explicit  solvent  simulation.  These  contact 
structures  were  all  sampled  by  the  simulations  performed  with 
implicit  solvent  models  but  with  different  probabilities.  The 
structures  from  the  most  dominant  cluster  (65%  of  the  selected 
structures  belonged  to  this  cluster)  in  explicit  solvent  simulation 
(Cluster  1)  resembled  those  in  the  most  populated  cluster 
generated  by  the  GBMV2  model.  However,  structures  from  the 
second  most  dominant  (20%)  cluster  seen  in  the  explicit  solvent 
simulations  (Cluster  2)  resembled  those  in  the  third  largest 
cluster  in  simulations  with  the  GBMV2  model.  Structures  from 
the  most  populated  (54%)  clusters  with  the  GBSW2  model 
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Figure  3.  The  potential  of  mean  force  (PMF)  distributions  of  dihedral  angle  of  the  three  central  residues  of  the  CAGQW  peptide  in  explicit  and 
implicit  solvent  models.  The  PMF  was  calculated  from  —RT  log (P(<f>,ip)),  where  P(tp,ip)  is  the  probability  distribution  of  the  dihedral  angles  tp  and 
ip  normalized  by  its  maximum,  R  is  the  gas  constant,  and  T  is  the  temperature  (300  K). 


resembled  those  in  the  third  largest  (14%)  cluster  in  explicit 
solvent  simulations  (Cluster  3).  The  relative  importance  or 
weight  of  these  three  contact  cluster  conformations  could  not 
be  captured  by  the  implicit  models.  Figure  4  also  shows  dihedral 
angle  </>  and  xp  distributions  in  terms  of  PMF  surfaces  for  the 
three  representative  conformations  for  each  cluster.  The  dihedral 
angle  distribution  of  Gly  displays  a  distinct  pattern  for  each 
cluster,  essentially  relating  how  the  peptide  bends  slightly 
differently  to  form  the  three  distinct  contact  states.  The  Ala 
residue  in  Cluster  2  and  the  Gin  residue  in  Cluster  1  favor 
/3-strand  like  dihedral  conformations. 

Dynamical  Properties.  End-to-End  Contact  Formation 
Dynamics.  We  calculated  the  survival  probability  of  the  open 
(noncontact)  state  SopenCO  for  implicit-solvent  models  coupled 
with  Lange vin  friction  coefficient  of  10  ps~'  and  explicit  solvent 
simulation.  SopenCO  in  Figure  5a  shows  that  the  open  noncontact 
state  of  the  CAGQW  peptide  survived  longer  in  explicit  solvent 
than  in  implicit  solvents  models.  Average  lifetimes  of  the  peptide 
in  the  open  state  ropen  were  estimated  by  applying  eq  1  to  the 
time  dependent  survival  probabilities  in  Figure  5a.  This  yielded 
ropen  of  1.6,  0.9,  and  0.6  ns  for  explicit,  GBMV2,  and  GBSW2 
solvent  models,  respectively,  as  summarized  in  Table  1.  Relative 
magnitudes  of  r0pen  values  are  consistent  with  the  respective 
barrier  heights  of  the  transition  from  noncontact  to  contact  state 
in  the  potential  of  mean  force  (PMF)  distribution  of  end-to-end 
distances  shown  in  Figure  5b.  The  larger  the  barrier  the  longer 
the  average  lifetime  of  the  peptide  is  in  the  open  state.  In 
addition,  Table  1  lists  Topen  obtained  for  simulations  with 
different  thermostats  and  friction  coefficients.  r0pen  values 
estimated  from  implicit-solvent  simulations  coupled  with  Lan- 
gevin  dynamics  with  the  friction  coefficient  of  10  ps~'  are 
significantly  larger  than  those  obtained  with  Nose— Hoover  or 
Berendsen  thermostats  and  are  in  better  agreement  with  the 
explicit  solvent  simulation.  It  is  also  clear  that  the  choice  of 
the  friction  coefficient  determines  the  dynamics  of  the  contact 
formation  and  provides  a  heuristic  that  can  be  used  to  match 
the  explicit-solvent  dynamics  results.  We  also  calculated  the 


survival  probability  of  the  closed  (contact)  state  as  a  function 
of  time  (data  not  shown)  and  the  average  lifetimes  of  the  peptide 
in  the  closed  state  rciOSed  listed  in  Table  1. 

Unlike  structural  properties  of  the  peptide,  dynamics  of  end- 
to-end  contacts  are  sensitive  to  the  choice  of  the  contact  distance 
used  in  the  analysis.  Therefore,  we  calculated  survival  prob¬ 
ability  distributions  and  lifetimes  of  open  and  closed  states  of 
the  peptide  with  contact  distances  defined  as  the  end-to-end 
distances  corresponding  to  peak  positions  separating  open  and 
closed  states  in  the  PMF  distribution  shown  in  Figure  5b.  The 
contact  distances  according  to  this  definition  were  5.6,  6.1,  and 
5.7  A  for  explicit  solvent,  GBSW2,  and  GBMV2  models, 
respectively.  The  results  are  shown  in  Figure  5c  and  Table  2. 
Overall,  we  observed  faster  end-to-end  contact  formations  and 
slower  contact  openings  with  larger  contact  distances.  However, 
the  relative  speeds  of  contact  formation  and  opening  among 
different  solvent  models  remained  the  same.  With  this  definition 
of  contact  distances,  we  estimated  the  equilibrium  fraction  of 
contact  states  as  kj(kc  +  ka)  or  rciosed/(Topen  +  rclosed)  and 
compared  them  with  those  observed  in  the  simulations  in  Table 
2.  Considering  the  approximations  involved  in  characterizing 
complex  dynamics  of  formation  and  opening  of  various  end- 
to-end  contacts  with  single  rate  constants,  the  agreement  between 
estimated  and  observed  values  for  the  equilibrium  fraction  of 
contact  states  is  quite  good. 

Diffusion  Dynamics  Associated  with  the  End-to-End  Dis¬ 
tance.  To  further  understand  effects  of  different  solvent  models 
on  dynamical  properties  of  peptides,  we  analyzed  the  dynamics 
associated  with  end-to-end  fluctuations  on  short  time  scales. 
Feig21  considered  Ca  root-mean-square-deviation  (rmsd)  changes 
of  proteins  as  a  function  of  time  to  examine  differences  in  kinetic 
properties  based  on  solvent  models,  thermostats,  and  Langevin 
friction  coefficients.  In  a  similar  spirit,  we  calculated  mean 
square  displacement  of  end-to-end  distances  as  function  of  time 
to  estimate  relative  magnitudes  of  diffusive  motions  for  the 
solvent  models  studied  here.  As  the  diffusive  motion  of  re  is 
limited  by  the  extent  of  the  peptide,  it  is  the  short  time  behavior 
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Figure  4.  Representative  structures  of  the  three  most  populated  clusters  from  the  CAGQW  structures  that  form  end-to-end  contact  in  explicit 
solvent  simulations.  The  relative  populations  of  the  representative  structures  in  each  implicit  solvent  are  also  shown.  The  numbers  in  parentheses 
indicate  backbone  RMSDs  between  the  average  structure  of  a  cluster  as  generated  from  the  explicit  simulation  and  the  structures  generated  with 
implicit  solvents.  The  potential  of  mean  force  (PMF)  distributions  of  dihedral  angles  of  the  three  central  residues  of  the  CAGQW  peptide  conformations 
that  form  end-to-end  contacts  are  also  shown.  For  each  cluster,  the  data  from  explicit  and  implicit  models  are  combined  to  prepare  the  PMF 
distributions  of  the  dihedral  angles. 


that  allows  us  to  gauge  the  relative  speeds  of  diffusive  dynamics 
of  the  peptide. 

Figure  6a  shows  mean  square  displacements  of  the  end-to- 
end  distance  as  a  function  of  time  from  simulations  in  explicit 
solvent  and  with  the  GBSW2  implicit  solvent  model  with 
different  thermostats.  The  results  obtained  with  the  GBMV2 
model  using  the  same  Langevin  friction  coefficient  of  10  ps~‘ 
showed  virtually  identical  results,  and  for  brevity  we  limit  the 
following  discussion  to  the  GBSW2  model.  As  expected,  the 
mean  square  displacement  of  the  end-to-end  distance  is  not  linear 
with  respect  to  the  time  interval  even  at  short  time  scales  as  the 
internal  diffusive  motion  of  the  peptide  is  not  free  but 
constrained  by  intramolecular  covalent  bonds.  Therefore,  a 
simple  linear  regression  cannot  be  applied  to  obtain  a  diffusion 
coefficient  as  in  the  case  of  a  free  diffusion.  However,  the  mean 
square  displacement  of  the  end-to-end  distance  at  a  short  time 


scale  is  expected  to  provide  information  on  relative  magnitudes 
of  diffusive  motions  of  end-to-end  distances  in  different  solvent 
models.  For  example,  we  can  see  clearly  that  the  diffusive 
movement  in  explicit  solvent  is  the  slowest  with  a  mean  square 
displacement  of  0.6  A2  after  1  ps,  the  shortest  recorded  time 
difference.  This  is  consistent  with  the  slow  end-to-end  contact 
formation  dynamics  shown  in  Figure  5  and  Tables  1  and  2  for 
the  explicit  solvent  model. 

We  examined  the  dynamical  behavior  of  the  peptide  as  a 
function  of  the  friction  coefficient.  Figure  6b  shows  the  mean 
square  displacement  of  the  end-to-end  distance  after  1  ps  as  a 
function  of  the  friction  coefficient  (we  assigned  a  zero  friction 
coefficients  for  simulations  that  did  not  employ  Langevin 
dynamics).  This  clearly  illustrated  that  the  friction  coefficient 
plays  an  important  role  in  defining  the  internal  diffusive 
dynamics  of  the  peptide.  Nose— Hoover  or  Berendsen  thermo- 
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Figure  5.  (a)  Survival  probability  Sopen(?)of  the  open  noncontact  state 
calculated  with  explicit  (a  solid  line),  GBMV2  (a  line  with  circles), 
and  GBSW2  (a  line  with  squares)  solvent  models.  The  implicit  solvent 
calculations  were  done  using  a  Langevin  friction  coefficient  of  10  ps-1. 
The  contact  distance  of  4  A  was  used  in  the  analysis,  (b)  The  PMF 
distributions  of  end-to-end  distances  calculated  from  —  RT  log  ( P(re )/ 
r|),  where  P(re)  is  the  probability  distribution  of  the  end-to-end  distance 
rt  as  shown  in  Figure  2b.  The  PMF  distributions  were  shifted  along 
the  y-axis  so  that  minima  near  the  contact  distance  coincide  at  0.0  kcal/ 
mol.  (c)  Same  as  in  panel  a  but  analyzed  with  different  contact  distances 
listed  in  Table  2,  which  correspond  to  re  values  at  PMF  barriers  dividing 
open  and  closed  states. 

TABLE  1:  Lifetimes  of  Open  and  Closed  States  for  the 
Fixed  Contact  Distance  of  4  A" 


TABLE  2:  Lifetimes  of  Open  and  Closed  States  for 
Alternate  Contact  Distance  Values” 

Solvent  =  ^closed/ 

model  dc  (A)  T0pen  (ps)  rc|nse(i  (ps)  (r0pen  T  rC]0sed)  ,/c 

explicit  5.6  835(17)  60(21)  0.067(0.023)  0.069(0.009) 

GBSW2  6.1  322(24)  11.6(0.8)  0.035  (0.003)  0.067  (0.004) 

GBMV2  5.7  451  (38)  12.8  (2.0)  0.028(0.005)  0.044(0.005) 

“  The  contact  distance  dc  in  this  table  was  defined  as  end-to-end 
distance  corresponding  to  peak  positions  separating  open  and  closed 
states  in  the  PMF  distribution  shown  in  Figure  5b.  Errors  in 
parentheses  are  based  on  the  95%  confidence  interval  estimated  by 
1.96  times  standard  error.  Standard  errors  were  estimated  by 
analysis  of  50  independent  simulations  for  implicit  solvent  models. 
For  explicit  solvent  simulations  with  varying  durations,  standard 
errors  were  estimated  by  dividing  simulations  into  5  groups 
containing  10  simulations  each.  /„:  fraction  of  contact  states 

calculated  from  P(re). 


solvent  model  temperature  control  Topen  (ps)  rciosed(ps) 


explicit 

Berendsen 

1590(188) 

6.1  (1.4) 

GBSW2 

Nose— Hoover 

188  (22) 

2.7  (0.3) 

GBSW2 

Berendsen 

136  (9) 

2.1  (0.2) 

GBSW2 

Langevin  2  ps~* 

213 (12) 

1.8  (0.1) 

GBSW2 

Langevin  10  ps~* 

595  (43) 

2.3  (0.1) 

GBSW2 

Langevin  60  ps~* 

3759(1041) 

4.5  (0.5) 

GBMV2 

Langevin  10  ps~* 

876  (80) 

2.8  (0.2) 

“  Errors  in  parentheses  are  based  on  the  95%  confidence  interval 
estimated  by  1.96  times  standard  error.  Standard  errors  were 
estimated  by  analysis  of  50  independent  simulations  for  implicit 
solvent  models.  For  explicit  solvent  simulations  with  varying 
durations,  standard  errors  were  estimated  by  dividing  simulations 
into  5  groups  containing  10  simulations  each.  Topen  and  Tciose[i  are 
average  lifetimes  of  open  and  closed  states,  respectively.  The  results 
for  the  explicit  solvent  model  in  this  and  the  following  table  were 
obtained  from  the  previous  simulation  of  the  CACQW  peptide  in 
explicit  solvent.30 

stats  yielded  relatively  faster  local  diffusive  motions  with 
significantly  larger  mean  square  displacement  at  1  ps  than 
obtained  with  the  explicit  solvent.  However,  using  these 
thermostats  with  implicit  solvent  models  could  lead  to  artifacts 
in  calculated  structural  properties  as  shown  in  Figure  2a.  In 
addition,  Feig21  observed  that  the  use  of  a  Nose— Hoover 
thermostat  instead  of  Langevin  dynamics  greatly  enhances  local 
conformational  sampling  but  slows  down  the  crossing  of 
barriers,  which  may  be  important  in  global  sampling,  by  at  least 
an  order  of  magnitude,  due  to  the  lack  of  solute— solvent 


Figure  6.  (a)  Mean  square  displacement  of  end-to-end  distance  re 
between  t  and  f  +  At  as  calculated  from  the  trajectories  generated  with 
the  GBSW2  model.  For  comparison,  the  explicit  solvent  values  are 
indicated  by  the  solid  line.  Lines  with  circles,  squares,  and  diamonds 
represent  results  obtained  from  the  implicit  model  using  a  Nose— Hoover 
thermostat,  a  Berendsen  thermostat,  and  Langevin  dynamics  with  a 
friction  coefficient  of  10  ps-1,  respectively,  (b)  Mean  square  displace¬ 
ment  of  end-to-end  distance  rc  between  t  and  f  +  1  ps  as  a  function  of 
the  friction  coefficient.  The  same  types  of  lines  or  symbols  used  in 
panel  a  have  been  adopted  to  represent  results  with  different  thermostats. 


stochastic  collisions.  Figure  6b  shows  that  a  smaller  Langevin 
friction  coefficient  of  2  ps~*  could  achieve  a  similar  degree  of 
local  conformational  sampling  obtained  with  Nose— Hoover  or 
Berendsen  thermostats.  As  was  observed  in  a  recent  simulation 
study  with  GBMV2,21  implicit-solvent  simulations  coupled  with 
Langevin  dynamics  with  an  applied  friction  coefficient  of  10 
ps~*  give  a  reasonable  agreement  with  the  explicit  solvent  result. 
In  a  comparative  study  of  generalized  Born  models,  good 
agreements  between  calculated  and  experimentally  derived 
structural  properties  were  also  obtained  by  simulations  with  the 
10  ps~'  friction  coefficient.  On  the  basis  of  these  considerations, 
Langevin  dynamics  with  an  applied  friction  coefficient  of  10 
ps-1  or  smaller  would  be  a  reasonable  choice  for  a  thermostat 
in  implicit-solvent  simulations. 

Figure  7a  shows  the  mean  displacement  of  the  end-to-end 
distance  after  1  ps  as  a  function  of  its  starting  value.  There  is 
a  good  agreement  between  distributions  of  the  mean  displace- 
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Figure  7.  (a)  Mean  displacements  associated  with  end-to-end  distance 
re  of  the  peptide  between  t  and  t  +  1  ps  as  a  function  of  starting  re 
distance.  A  line  with  circles  represents  results  with  the  GBSW2  implicit 
solvent  model.  The  solid  line  represents  results  from  simulations  with 
explicit  solvent,  (b)  Mean  square  displacements  associated  with  end- 
to-end  distance  re  of  the  peptide  between  t  and  t  +  1  ps  as  a  function 
of  starting  re  distance. 

ment  calculated  from  explicit  and  GBSW2  implicit  solvent 
models.  At  small  and  large  starting  end-to-end  distances,  the 
mean  displacement  is  positive  and  negative,  respectively, 
corresponding  to  a  reflective  motion  in  the  bound  potential. 
Deviation  from  zero  mean  displacement  indicates  that  dynamics 
of  end-to-end  distances  is  influenced  by  conformation  specific 
preferences  rather  than  random  diffusive  motions.  Except  for 
large  (re  >  15  A)  or  small  (re  <4  A)  end-to-end  distances  where 
the  population  density  is  low,  the  mean  displacement  is  close 
to  zero,  which  indicates  that  internal  dynamics  on  the  shorter 
time  scale  of  1  ps  is  largely  diffusive. 

Figure  7b  shows  the  mean  square  displacement  of  the  end- 
to-end  distance  after  1  ps  as  a  function  of  its  starting  value.  As 
can  be  seen,  the  random  diffusive  motion  associated  with  the 
end-to-end  distance  increases  close  to  the  barrier  and  slows  near 
contact  conformations  of  the  peptide  compared  to  the  average 
value  (Figure  6b).  At  the  barrier  (re  ~  6  A),  dynamics  are 
accelerated  as  the  peptide  seeks  a  lower  energy  conformation 
by  either  closing  or  opening  up  the  peptide.  In  the  contact 
conformation  (re  ~  4  A),  the  peptide  is  energetically  trapped 
and  the  motion  restricted,  which  is  reflected  in  the  lowered  mean 
square  displacement.  In  general,  the  implicit  model  exhibits 
greater  local  variations  than  the  explicit  solvent  model.  This  is 
compatible  with  the  faster  dynamics  observed  in  the  contact 
and  noncontact  lifetimes  for  the  implicit  models.  Ultimately, 
the  dynamics  is  governed  by  the  potential  energy  landscape, 
and  the  faster  dynamics  exhibited  by  the  implicit  model  are 
indicative  of  a  less  rugged  energy  landscape.20 

Summary  and  Conclusions 

In  this  study,  we  investigated  structural  and  dynamical 
properties  of  a  small  peptide  CAGQW  with  different  solvent 
models.  Simulations  with  the  GBSW2  and  GBMV2  implicit 
solvent  models  showed  similar  distributions  of  end-to-end 
distances  and  dihedral  angles  as  with  the  explicit  solvent  model. 
A  close  examination  of  end-to-end  contact  conformations  of 
the  peptide  revealed  that  representative  structures  observed  in 
explicit  solvent  were  sampled  by  implicit  solvent  models  but 
with  different  relative  probabilities.  These  calculations  were 
based  on  the  same  CHARMM22  force  field.  This  is  in  contrast 


with  the  small  overlap  (2%  within  2  A  rmsd)  between  end-to- 
end  contact  conformations  observed  in  explicit  solvent  simula¬ 
tions  using  two  different  force  fields  (AMBER94  and 
CHARMM22).30 

We  observed  significant  differences  in  dynamical  properties 
of  peptides  between  simulations  performed  by  explicit  and 
implicit  solvent  models.  The  peptide  in  explicit  solvent  displayed 
the  slowest  dynamics  in  both  contact  formation  and  the  intrinsic 
diffusive  motion.  The  slow  internal  diffusive  motion  of  the 
peptide  in  explicit  solvent  could  reasonably  be  accounted  for 
in  implicit  models  by  using  Langevin  dynamics  and  applying  a 
friction  coefficient  of  10  ps~'.  However,  details  related  to  the 
energy  landscape  between  the  explicit  and  implicit  model  drive 
the  observed  difference  in  dynamical  properties. 

The  study  of  the  small  peptide  presented  here  allowed  us  in 
great  detail  and  with  high  confidence  to  calculate  the  differences 
between  explicit  and  implicit  solvent  models.  Our  study 
highlighted  the  similarities  and  differences  that  can  be  expected 
when  using  a  generalized  Born  implicit  solvent  to  model 
structural  and  dynamical  properties  of  peptides  and  proteins.  A 
more  detailed  study  with  larger  peptides  and  diverse  set  of  amino 
acid  sequences  would  be  needed  for  a  thorough  comparison  of 
explicit  and  implicit  solvent  models.  We  noted  that  structural 
and  dynamical  properties  obtained  from  implicit  solvent  simula¬ 
tions  are  quite  sensitive  to  details  of  the  calculation,  such  as 
the  choice  of  thermostat.  However,  we  also  affirmed  that 
simulations  with  GBMV2  or  GBSW2  implicit  solvent  models 
coupled  with  Langevin  dynamics  with  a  friction  coefficient  near 
10  ps~'  can  give  good  agreements  with  explicit-solvent 
simulations. 
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